Identifying network failures and evaluating link MTBF from utilization logs
نویسندگان
چکیده
Network failure detection techniques and link Mean Time Between Failure (MTBF) estimates are required to assess the reliability of large communication networks. We present an experience based on utilization logs from a large ISP network comprising hundreds of links, and spreading over a geographic area. The complexity of the network requires accounting also for the mutual dependencies between events on different links. Nevertheless, we show that robust non-parametric data mining methods offer a simple and effective way to accomplish the task.
منابع مشابه
Evaluating Multipath TCP Resilience against Link Failures
Standard TCP is the de facto reliable transfer protocol for the Internet. It is designed to establish a reliable connection using only a single network interface. However, standard TCP with single interfacing performs poorly due to intermittent node connectivity. This requires the re-establishment of connections as the IP addresses change. Multi-path TCP (MPTCP) has emerged to utilize multiple ...
متن کاملMTBF evaluation for 2-out-of-3 redundant repairable systems with common cause and cascade failures considering fuzzy rates for failures and repair: a case study of a centrifugal water pumping system
In many cases, redundant systems are beset by both independent and dependent failures. Ignoring dependent variables in MTBF evaluation of redundant systems hastens the occurrence of failure, causing it to take place before the expected time, hence decreasing safety and creating irreversible damages. Common cause failure (CCF) and cascading failure are two varieties of dependent failures, both l...
متن کاملTruncation error analysis of MTBF computation for multi-latch synchronizers
Chip designs have an increasing number of independent clock domains. Synchroniser circuits are used to facilitate reliable data transfers between these clock domains. The task of these synchronisers is inherently prone to the occasional, statistically random, failure. These failures are frequently quantified by the synchronisers’ Mean Time Between Failures, MTBF. The MTBF becomes worse at an ex...
متن کاملA Multi Objective Graph Based Model for Analyzing Survivability of Vulnerable Networks
In the various fields of disaster management, choosing the best location for the Emergency Support & Supply Service Centers (ESSSCs) and the survivability of the network that provides the links between ESSSCs and their environment has a great role to be paid enough attention. This paper introduces a graph based model to measure the survivability of the linking's network. By values computed for ...
متن کاملTuple switching network - When slower may be better
This paper reports an application dependent network design for extreme scale high performance computing (HPC) applications. Traditional scalable network designs focus on fast point-to-point transmission of generic data packets. The proposed network focuses on the sustainability of high performance computing applications by statistical multiplexing of semantic data objects. For HPC applications ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006